NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generalizing to Unseen Domains via Text-Guided Augmentation: A Training-Free Approach

https://doi.org/10.1007/978-3-031-72890-7_17

Qi, Daiqing; Zhao, Handong; Zhang, Aidong; Li, Sheng (December 2024, The European Conference on Computer Vision)

Full Text Available
MedCite: Can Language Models Generate Verifiable Text for Medicine?

https://doi.org/10.18653/v1/2025.findings-acl.967

Wang, Xiao; Tan, Mengjue; Jin, Qiao; Xiong, Guangzhi; Hu, Yu; Zhang, Aidong; Lu, Zhiyong; Zhang, Minjia (January 2025, Association for Computational Linguistics)

Full Text Available
On the Role of Server Momentum in Federated Learning

Sun, Jianhui; Wu, Xidong; Huang, Heng; Zhang, Aidong (February 2024, Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2024))

Full Text Available
On the Role of Server Momentum in Federated Learning

Sun, Jianhui; Wu, Xidong; Huang, Heng; Zhang, Aidong (February 2024, Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2024))

Full Text Available
DeepGSEA: explainable deep gene set enrichment analysis for single-cell transcriptomic data

https://doi.org/10.1093/bioinformatics/btae434

Xiong, Guangzhi; LeRoy, Nathan_J; Bekiranov, Stefan; Sheffield, Nathan_C; Zhang, Aidong; Wren, ed., Jonathan (July 2024, Bioinformatics)

Abstract MotivationGene set enrichment (GSE) analysis allows for an interpretation of gene expression through pre-defined gene set databases and is a critical step in understanding different phenotypes. With the rapid development of single-cell RNA sequencing (scRNA-seq) technology, GSE analysis can be performed on fine-grained gene expression data to gain a nuanced understanding of phenotypes of interest. However, with the cellular heterogeneity in single-cell gene profiles, current statistical GSE analysis methods sometimes fail to identify enriched gene sets. Meanwhile, deep learning has gained traction in applications like clustering and trajectory inference in single-cell studies due to its prowess in capturing complex data patterns. However, its use in GSE analysis remains limited, due to interpretability challenges. ResultsIn this paper, we present DeepGSEA, an explainable deep gene set enrichment analysis approach which leverages the expressiveness of interpretable, prototype-based neural networks to provide an in-depth analysis of GSE. DeepGSEA learns the ability to capture GSE information through our designed classification tasks, and significance tests can be performed on each gene set, enabling the identification of enriched sets. The underlying distribution of a gene set learned by DeepGSEA can be explicitly visualized using the encoded cell and cellular prototype embeddings. We demonstrate the performance of DeepGSEA over commonly used GSE analysis methods by examining their sensitivity and specificity with four simulation studies. In addition, we test our model on three real scRNA-seq datasets and illustrate the interpretability of DeepGSEA by showing how its results can be explained. Availability and implementationhttps://github.com/Teddy-XiongGZ/DeepGSEA
more » « less
Solving a Class of Non-Convex Minimax Optimization in Federated Learning

Wu, Xidong; Sun, Jianhui; Hu, Zhengmian; Zhang, Aidong; Huang, Heng (December 2023, Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023))

Full Text Available
Federated Conditional Stochastic Optimization

Wu, Xidong; Sun, Jianhui; Hu, Zhengmian; Li, Junyi; Zhang, Aidong; Huang, Heng (December 2023, Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023))

Full Text Available
ProtoCell4P: an explainable prototype-based neural network for patient classification using single-cell RNA-seq

https://doi.org/10.1093/bioinformatics/btad493

Xiong, Guangzhi; Bekiranov, Stefan; Zhang, Aidong (August 2023, Bioinformatics)
Wren, Jonathan (Ed.)
Abstract MotivationThe rapid advance in single-cell RNA sequencing (scRNA-seq) technology over the past decade has provided a rich resource of gene expression profiles of single cells measured on patients, facilitating the study of many biological questions at the single-cell level. One intriguing research is to study the single cells which play critical roles in the phenotypes of patients, which has the potential to identify those cells and genes driving the disease phenotypes. To this end, deep learning models are expected to well encode the single-cell information and achieve precise prediction of patients’ phenotypes using scRNA-seq data. However, we are facing critical challenges in designing deep learning models for classifying patient samples due to (i) the samples collected in the same dataset contain a variable number of cells—some samples might only have hundreds of cells sequenced while others could have thousands of cells, and (ii) the number of samples available is typically small and the expression profile of each cell is noisy and extremely high-dimensional. Moreover, the black-box nature of existing deep learning models makes it difficult for the researchers to interpret the models and extract useful knowledge from them. ResultsWe propose a prototype-based and cell-informed model for patient phenotype classification, termed ProtoCell4P, that can alleviate problems of the sample scarcity and the diverse number of cells by leveraging the cell knowledge with representatives of cells (called prototypes), and precisely classify the patients by adaptively incorporating information from different cells. Moreover, this classification process can be explicitly interpreted by identifying the key cells for decision making and by further summarizing the knowledge of cell types to unravel the biological nature of the classification. Our approach is explainable at the single-cell resolution which can identify the key cells in each patient’s classification. The experimental results demonstrate that our proposed method can effectively deal with patient classifications using single-cell data and outperforms the existing approaches. Furthermore, our approach is able to uncover the association between cell types and biological classes of interest from a data-driven perspective. Availability and implementationhttps://github.com/Teddy-XiongGZ/ProtoCell4P.
more » « less
Full Text Available
Learning for Counterfactual Fairness from Observational Data

https://doi.org/10.1145/3580305.3599408

Ma, Jing; Guo, Ruocheng; Zhang, Aidong; Li, Jundong (August 2023, ACM)

Fairness-aware machine learning has attracted a surge of attention in many domains, such as online advertising, personalized recommendation, and social media analysis in web applications. Fairness-aware machine learning aims to eliminate biases of learning models against certain subgroups described by certain protected (sensitive) attributes such as race, gender, and age. Among many existing fairness notions, counterfactual fairness is a popular notion defined from a causal perspective. It measures the fairness of a predictor by comparing the prediction of each individual in the original world and that in the counterfactual worlds in which the value of the sensitive attribute is modified. A prerequisite for existing methods to achieve counterfactual fairness is the prior human knowledge of the causal model for the data. However, in real-world scenarios, the underlying causal model is often unknown, and acquiring such human knowledge could be very difficult. In these scenarios, it is risky to directly trust the causal models obtained from information sources with unknown reliability and even causal discovery methods, as incorrect causal models can consequently bring biases to the predictor and lead to unfair predictions. In this work, we address the problem of counterfactually fair prediction from observational data without given causal models by proposing a novel framework CLAIRE. Specifically, under certain general assumptions, CLAIRE effectively mitigates the biases from the sensitive attribute with a representation learning framework based on counterfactual data augmentation and an invariant penalty. Experiments conducted on both synthetic and real-world datasets validate the superiority of CLAIRE in both counterfactual fairness and prediction performance.
more » « less
Full Text Available
Towards Generalized mmWave-based Human Pose Estimation through Signal Augmentation

https://doi.org/10.1145/3570361.3613302

Xue, Hongfei; Cao, Qiming; Miao, Chenglin; Ju, Yan; Hu, Haochen; Zhang, Aidong; Su, Lu (October 2023, ACM)

Full Text Available

« Prev Next »

Search for: All records